Automatic Parameter Tuning of Three-Dimensional Tiled FDTD Kernel
نویسندگان
چکیده
This paper introduces an automatic tuning method for the tiling parameters required in an implementation of the three-dimensional FDTD method based on time-space tiling. In this tuning process, an appropriate range for the tile size is first determined by trial experiments using cubic tiles. The tile shape is then optimized by using the Monte Carlo method. The tiled FDTD kernel was multi-threaded and its performance with the tuned parameters was evaluated on multi-core processors. When compared with a naively implemented kernel, the performance of the tuned FDTD kernel was improved by more than a factor of two.
منابع مشابه
Performance Improvement of Three-Dimensional Tiled FDTD Kernel Based on Automatic Parameter Tuning
This paper introduces an automatic tuning method of the tiling parameters required in the implementation of the three-dimensional FDTD method based on time-space tiling. The tuned tiled FDTD kernel was multi-threaded and its performance was evaluated on a multi-core processor. Compared with a naïvely implemented kernel, this tuned FDTD kernel performed better by more than a factor of two.
متن کاملA Fast and Automatic Kernel-based Classification Scheme: GDA+SVM or KNWFE+SVM
For high-dimensional data classification such as hyperspectral image classification, feature extraction is a crucial pre-process for avoiding the Hughes phenomena. Some feature extraction methods such as linear discriminant analysis (LDA), nonparametric weighted feature extraction (NWFE), and their kernel versions, generalized discriminant analysis (GDA) and kernel nonparametric weighted featur...
متن کاملA New Computer-Aided Detection System for Pulmonary Nodule in CT Scan Images of Cancerous Patients
Introduction: In the lung cancers, a computer-aided detection system that is capable of detecting very small glands in high volume of CT images is very useful.This study provided a novelsystem for detection of pulmonary nodules in CT image. Methods: In a case-control study, CT scans of the chest of 20 patients referred to Yazd Social Security Hospital were examined. In the two-dimensional and ...
متن کاملTools for Performance Optimizations and Tuning of Affine Loop Nests
Multicore processors have become mainstream and the number of cores in a chip will continue to increase every year. Programming these architectures to effectively exploit their very high computation power is a non trivial task. First, an application program needs to be explicitly restructured using a set of code transformation techniques to optimize for specific architectural features, especial...
متن کاملVideo analysis based on Multi-Kernel Representation with automatic parameter choice
In this work, we analyze video data by learning both the spatial and temporal relationships among frames. For this purpose, the nonlinear dimensionality reduction algorithm, Laplacian Eigenmaps, is improved using a multiple kernel learning framework, and it is assumed that the data can be modeled by means of two different graphs: one considering the spatial information (i.e., the pixel intensit...
متن کامل